Model Selection

Instruction Fine-Tuning Optimization

# Instruction Fine-Tuning Optimization

Granite 3.3 8b Instruct Q8 0 GGUF

This model is a GGUF format model converted from the IBM Granite-3.3-8B instruction fine-tuned model, suitable for text generation tasks.

Large Language Model

Opencodereasoning Nemotron 7B

OpenCodeReasoning-Nemotron-7B is a large language model developed based on Qwen2.5-7B-Instruct, focusing on code generation and reasoning tasks, supporting a context length of 32K tokens.

Large Language Model

Transformers Supports Multiple Languages

Llama SEA LION V3.5 70B R

Llama-SEA-LION-v3.5-70B-R is a hybrid-function large language model optimized for Southeast Asian languages, supporting 13 languages with capabilities in complex reasoning and general text generation.

Large Language Model

Transformers Supports Multiple Languages

Flan T5 Titlegen Springer

A model fine-tuned based on google/flan-t5-base, specifically designed for the abstractive summarization task of refining scientific abstracts into concise titles.

Text Generation

Transformers English

Qwen.qwen2.5 VL 3B Instruct GGUF

Qwen2.5-VL-3B-Instruct is a 3B-parameter vision-language model that supports image-to-text generation tasks.

Llama 3.1 8B SuperNova EtherealHermes GGUF

An 8B-parameter large language model based on the Llama-3.1 architecture, offering multiple quantized versions in GGUF format

Large Language Model English

T3Q Qwen2.5 14b V1.0 E3

A post-trained version based on the Qwen/Qwen2.5-14B-Instruct-1M model, using LoRA-8-4-0.0001-cosine-32-16 configuration, trained on train_data_v1.0.

Large Language Model

Transformers Supports Multiple Languages

Hymba 1.5B Instruct

A 1.5B-parameter model fine-tuned for instructions based on Hymba-1.5B-Base, capable of handling complex tasks such as mathematical reasoning, function calling, and role-playing

Large Language Model

Videollama2.1 7B 16F Base

VideoLLaMA2.1 is an upgraded version of VideoLLaMA2, focusing on enhancing spatiotemporal modeling and audio understanding capabilities in large video-language models.

Transformers English

Videollama2.1 7B 16F

VideoLLaMA 2 is a multimodal large language model focused on video understanding, equipped with spatiotemporal modeling and audio comprehension capabilities.

Transformers English

Llama 3.1 8B Dragonfly V2

Dragonfly is a multimodal vision-language model fine-tuned with instructions based on Llama 3.1, supporting joint understanding and generation of images and text

Image-to-Text English

togethercomputer

Mistral 7B V0.3

Mistral-7B-v0.3 is an upgraded large language model based on Mistral-7B-v0.2, with the main improvement being the expansion of the vocabulary to 32,768 tokens.

Large Language Model

Llama 3 Stinky V2 8B

This is an 8B-parameter model based on the Llama-3 architecture, merged using the mergekit tool, with strong text generation capabilities.

Large Language Model

Granite 8b Code Instruct 4k

Granite-8B-Code-Instruct-4K is an 8-billion-parameter code instruction model, fine-tuned on various permissible instruction datasets based on Granite-8B-Code-Base-4K, enhancing its ability to follow instructions, including logical reasoning and problem-solving skills.

Large Language Model

Transformers Other

Granite 3b Code Instruct 2k

Granite-3B-Code-Instruct-2K is a 3-billion-parameter model fine-tuned from Granite-3B-Code-Base-2K, with enhanced instruction-following capabilities, particularly excelling in code generation and logical reasoning tasks.

Large Language Model

Transformers Other

Turkcell LLM 7b V1

A Turkish large language model based on the Mistral 7B architecture, trained on 5 billion Turkish tokens and fine-tuned with instructions

Large Language Model

Transformers Other

Calme 7B Instruct V0.9

Calme-7B is a 7-billion-parameter language model fine-tuned based on Mistral-7B, excelling in generating clear, peaceful, and coherent text.

Large Language Model

Gemma 1.1 2b It

Gemma is a lightweight open model series launched by Google, built on the same technology as Gemini, suitable for various text generation tasks.

Large Language Model

Codellama 70b Instruct Hf

Code Llama is a series of code generation and understanding models released by Meta, ranging from 7 billion to 70 billion parameters. This model is the 70 billion parameter instruction fine-tuned version.

Large Language Model

Transformers Other

CausalLM/14B-DPO-α is a large-scale causal language model supporting Chinese and English text generation tasks, with outstanding performance in MT-Bench evaluations.

Large Language Model

Transformers Supports Multiple Languages

FinMA-7B-NLP is a large language model for the financial domain developed by the PIXIU project, specifically designed to understand complex financial terms and concepts, significantly improving performance in downstream financial tasks through natural language instruction fine-tuning.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase